|
|
Accession Number |
TCMCG075C30090 |
gbkey |
CDS |
Protein Id |
XP_017985035.1 |
Location |
join(17820111..17820157,17820263..17820359,17820457..17820578,17823880..17823972,17824142..17824214,17824295..17824385,17824477..17824529,17825255..17825319,17826019..17826124,17826252..17826362,17827476..17827535,17829252..17829290,17829501..17829551,17829634..17829732,17830443..17830502,17830605..17830697,17831522..17831626,17831722..17831832,17831972..17832088,17833562..17833678,17834780..17834860,17836409..17836513,17836591..17836671,17836761..17836892,17836986..17837030,17837583..17837660,17837730..17837869,17837995..17838085,17840770..17840894,17840980..17841115,17841225..17841355,17842537..17842662,17842766..17842821,17843684..17843754,17844409..17844493,17844586..17844689,17845328..17845456,17846554..17846604) |
Gene |
LOC18587075 |
GeneID |
18587075 |
Organism |
Theobroma cacao |
|
|
Length |
1158aa |
Molecule type |
protein |
Topology |
linear |
Data_file_division |
PLN |
dblink |
BioProject:PRJNA341501 |
db_source |
XM_018129546.1
|
Definition |
PREDICTED: DNA-directed RNA polymerase III subunit 2 isoform X1 [Theobroma cacao] |
CDS: ATGGGTCTCAAGCAAGAAGATTTACTTCTCAATAACAACACCAATAATTCAAATGTTTCTGAGATGCCCGTTGATAAGCAAAAGCTTGCTGCTCCAATAAAATCAGCTGTTGATAAGTTTCAGCTGCTACCTGAGTTTTTAAAGGTTAGAGGATTAGTGAAGCAACACTTGGATTCATTTAACTATTTTGTTAATACTGGAATAAAGAAGATTGTCCGTGCCAATGACCGGATTGTATCCGGTGTTGACCCCAGTATTTACCTCAGGTTTAAAGATGTTAGAATTGGTGAGCCCTCCATGACGATCAATGCAGTCAGTGAAAAAATAAATCCCCATACATGCCGGTTGTCAGACATGACGTATGCTGCACCGATATTTGTCAATATAGAGTACATTCAAGGGAGTCATGGGCAGAAAACTAGACTGGAAAAGAATGATCTTGTTATTGGAAGAATGCCTATCATGTTAAGGAGTTGTTGCTGTGTGTTATATGGAAAAGACGAAGCTGAACTTGCGAGGCTTGGTGAGTGCCCCCTTGATCCTGGAGGATACTTTGTCATTAAAGGAGCAGAGAAGGTGATTTTAATACAGGAACAGCTTTCCAAGAATAGAATAATCATTGATGCAGACAAGAAGGGAAATATAAATGCATCTGTGACAAGCAGTTCAGAGGCAACAAAAAGCAAAACAGTTATTCAGATGGAGAAAGAAAAGATATATTTACTTCTCAATCAATTTGTGAAAAAGATCCCTATTATGGTGGTCATGAAGGCAATGGGGATGGAGAGTGATCAAGAGGTTGTGCAGATGGTTGGTAGAGATCCTCATTATAATGCCGTTCTTTTGCCTTCTATAGAGGAATGTGCAGGAGTTGGCATTTATACTCAGGAACAAGCACTGGAGTACCTAGAGACGAAGGTGAAAAGAGTTATGTACACTGGTCCTGCATCTGAGAAGGAAGGAAGAGCTCTGTCTATCCTTCGAGATGTATTTCTTGCCAACGTTCCAGTGCGTTCTAATAATTTTCGTCCAAAATGTTTGTATGTTGCGGTAATGCTGAGAAGGATGGTGGAGGCAATTTTAAATAAGGATGCGATGGATGACAAGGATTATGTGGGGAACAAGCGTCTAGAGCTATCAGGACAATTAATCTCTCTGCTTTTTGAGGATTTGTTCAAGACAACGATTAGCGAAGTGCAGAAGATGATTGATCTCGTGTTATCAAAGCCCAGCAGATCTAGTGCTTTGGACCCTTCTCAGTTTTTACGTAGTAGAGAGACCATTACGTTTGGGCTAGAAAGGACCCTTTCTACTGGTAACTTCGATATAAAGCGTTTCAAAATGCACAGAAAAGGCATGACACAGGTGCTAGCAAGGTTATCCTTTATTGGGACTTTGGGCTATATGACAAAAGTCTCACCACAGTTTGAGAAGTCTCGGAAAGTAAGTGGACCGAGGGCCTTGCAACCTAGCCAGTGGGGAATGCTTTGCCCTTGTGATACTCCTGAAGGTGAAGCTTGTGGACTGGTTAAAAACTTAGCACTAATGACTCATGTTACAACTGATGAGGATGAGGGTCCTTTGATTTCTCTGTGCTATTGCTTGGGCGTTGAAGACTTGGAGCTACTATCTGGGGAAGAGCTTCATACACCAAATTCTTTCCTAGTTATATTAAATGGGCTCATTCTTGGCAAACATAGACGGCCACAGCATTTTGCTGTGGCTATGAGAAAGCTGCGGAGAGCTGGCAAAGTTGGTGAGTTTGTGAGTGTCTTCGTAAATGAGAAGCAGCGTTGTGTTTACATTGCTTCTGATGGAGGTCGAGTGTGTCGACCGTTGGTAATAGCTGACAAGGGAGTATCAAGGATCAAAGAACACCATATGAAGGAGTTATTGGATGGAGTCCGCACTTTTGATGACTTTTTACGTGATGGATTGATTGAATATCTTGATGTCAATGAGGAGAACAATGCTCTGATTGCTTTATATGAAGGAGAGGCTACACCTGAAACAACCCATATCGAGATAGAGCCTTTCACGATCTTAGGTGTTTGTGCTGGGCTTATTCCATATCCTCATCATAATCAGTCACCAAGAAATACCTATCAGTGTGCAATGGGGAAGCAAGCAATGGGAAATATTGCATATAACCAGTTGTGCCGGATGGACACATTACTATATCTATTGGTGTATCCTCAGCGGCCTTTATTGACAACGAGGACAATTGAACTGGTTGGATATGATAAGCTTGGAGCTGGTCAGAATGCGATTGTTGCTGTGATGAGTTATAGTGGGTATGACATAGAGGACGCAATTGTCATGAACAAGTCTTCTCTAGACCGTGGTTTTGGTCGTTGTATTGTGATGAAAAGGTATTCTGCCGTTAATCAAAAATATGAAACTGGTGCATCTGATAGAATACTTAGGCCACAGAGAACAGGACCTGGTTCAGAAAGGATGCAGATATTAGATGATGATGGAATTGCTACTCCTGGGGAGATTATTAGACCAAATGATATCTACATTAATAAGGAGTCCTCTATTCATACAAGAGGATCTCGTGTATCTTCAGAATCTCTACCTGATAGCGCATATAGACCTGCTAGGCAAACATACAAAGGTCCTGAAGGAGAGTCTTGTGTGGTGGATAGAGTTGCTCTTTGCACTGATAGGAACAGCAATCTATCTATTAAATTTTTAATACGCCATACACGTCGACCTGAGGTTGGTGACAAATTTAGTAGCAGACATGGCCAGAAAGGTGTTTGTGGCACTATCATTCAGCAGGAAGATTTCCCATTTTCTGAGCGTGGCATTTGTCCTGATTTAATTATGAATCCTCATGGTTTTCCAAGTCGAATGACTGTAGGTAAGATGGTAGAGCTTCTTGGAGGCAAAGCTGGAGTATCATGTGGTAGGTTCCATTATGGTAGTGCCTTTGGGGAGCCTAGTGGTCATGCGGATAGGGTTGAAGCTATAAGTGAAACCCTCATCAAGCATGGTTTTAGCTACAATGGCAAGGACTTCATTTATTCAGGTATTACAGGTTGTCCACTGCAAGCATATATTTTTATGGGACCAATTTACTACCAGAAGTTGAAGCATATGGTCCTTGACAAAATGCATGCCAGAGGTAATGGGCCTCGAGTTATGCTAACTAGACAGCCTACAGAAGGGAGAGCTCGAAATGGAGGGTTACGAGTAGGAGAAATGGAACGTGATTGTCTAATTGCTTATGGTGCTAGCATGTTGATTTTCGAGCGCCTGATGATTTCCAGTGATCCTTTTGAAGTTCAGGTTTGCAGAAAATGTGGTTTGTTAGGATACTACAGCCATAAGCTGAAAACTGGGATTTGTTCTTCATGTAAAAATGGCGATAATGTTTCTACTATGAAGCTACCATACGCATGCAAGCTTTTGATCCAGGAACTCCAGTCAATGAACATTGTGCCACGTTTGAAACTATCAGAGGCTTGA |
Protein: MGLKQEDLLLNNNTNNSNVSEMPVDKQKLAAPIKSAVDKFQLLPEFLKVRGLVKQHLDSFNYFVNTGIKKIVRANDRIVSGVDPSIYLRFKDVRIGEPSMTINAVSEKINPHTCRLSDMTYAAPIFVNIEYIQGSHGQKTRLEKNDLVIGRMPIMLRSCCCVLYGKDEAELARLGECPLDPGGYFVIKGAEKVILIQEQLSKNRIIIDADKKGNINASVTSSSEATKSKTVIQMEKEKIYLLLNQFVKKIPIMVVMKAMGMESDQEVVQMVGRDPHYNAVLLPSIEECAGVGIYTQEQALEYLETKVKRVMYTGPASEKEGRALSILRDVFLANVPVRSNNFRPKCLYVAVMLRRMVEAILNKDAMDDKDYVGNKRLELSGQLISLLFEDLFKTTISEVQKMIDLVLSKPSRSSALDPSQFLRSRETITFGLERTLSTGNFDIKRFKMHRKGMTQVLARLSFIGTLGYMTKVSPQFEKSRKVSGPRALQPSQWGMLCPCDTPEGEACGLVKNLALMTHVTTDEDEGPLISLCYCLGVEDLELLSGEELHTPNSFLVILNGLILGKHRRPQHFAVAMRKLRRAGKVGEFVSVFVNEKQRCVYIASDGGRVCRPLVIADKGVSRIKEHHMKELLDGVRTFDDFLRDGLIEYLDVNEENNALIALYEGEATPETTHIEIEPFTILGVCAGLIPYPHHNQSPRNTYQCAMGKQAMGNIAYNQLCRMDTLLYLLVYPQRPLLTTRTIELVGYDKLGAGQNAIVAVMSYSGYDIEDAIVMNKSSLDRGFGRCIVMKRYSAVNQKYETGASDRILRPQRTGPGSERMQILDDDGIATPGEIIRPNDIYINKESSIHTRGSRVSSESLPDSAYRPARQTYKGPEGESCVVDRVALCTDRNSNLSIKFLIRHTRRPEVGDKFSSRHGQKGVCGTIIQQEDFPFSERGICPDLIMNPHGFPSRMTVGKMVELLGGKAGVSCGRFHYGSAFGEPSGHADRVEAISETLIKHGFSYNGKDFIYSGITGCPLQAYIFMGPIYYQKLKHMVLDKMHARGNGPRVMLTRQPTEGRARNGGLRVGEMERDCLIAYGASMLIFERLMISSDPFEVQVCRKCGLLGYYSHKLKTGICSSCKNGDNVSTMKLPYACKLLIQELQSMNIVPRLKLSEA |